Apparatus for enhanced electrochemical deposition

ABSTRACT

One embodiment of the invention is an apparatus for electrochemical deposition (ECD) of a metal or an alloy inside at least one opening located at a front surface of a substrate, said apparatus including: (a) an electrochemical deposition (ECD) cell adapted to contain an electrolyte, the electrolyte including plating metallic ions and at least one inhibitor additive; (b) a cathode including at least a portion of said front surface of the substrate, wherein at least one surface inside said at least one opening includes an exposed metallic surface, and wherein at least a portion of said cathode is immersed in said electrolyte; (c) at least one anode, wherein at least a portion of the at least one anode is immersed in said electrolyte; (d) a power supply adapted to generate an electroplating current through the electrolyte between said cathode and said at least one anode; and (e) means for producing a turbulent flow of the electrolyte across the front surface of the substrate, wherein the turbulent flow is capable of producing a diffusion layer thickness of less than about 50 μm at said front surface of the substrate.

This is a continuation of application Ser. No. 11/085,971, filed on Mar. 21, 2005, now U.S. Pat. No. 7,247,563, which is a continuation of application Ser. No. 10/112,332, filed on Mar. 29, 2002, now U.S. Pat. No. 6,869,515, which itself claims the benefit under 35 U.S.C. 119(e) of U.S. Provisional Application No. 60/280,325 filed on Mar. 30, 2001.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to the field of metal or alloy electrochemical deposition (ECD) for filling narrow and high aspect ratio openings. In particular, the invention discloses new methods which enhance reliable, fast, and void-free filling of very small openings, with large aspect ratios, such as vias and trenches in semiconductor devices, thin film heads, electronic high density packages, or micro electromechanical system (MEMS) devices. The new methods are particularly effective for the so called “Damascene” and “Dual Damascene” copper interconnects, providing fast, reliable, and void-free copper filling by electroplating inside vias and trenches in the manufacture of semiconductor devices.

2. Description of the Prior Art

There are two methods to fill patterned openings by electroplating. In one method, an insulating mask such as an oxide, photoresist, or polyimide layer is patterned over a conductive metallic surface (or a “seed layer” or “plating base”), exposing the metallic surface only at the bottom of the openings. Electroplating is carried out through the openings in the insulating mask, and is confined inside the openings of the mask. Usually, following the plating, the insulating mask is removed and the seed layer (which was covered by the insulating mask) is etched away. This method is often used in the fabrication of, for example, coils and other metallic structures of thin film heads, metallic conductors in high density packages, and in MEMS devices.

In the other method, sometimes referred to as “Damascene” or “Dual Damascene”, an insulating (or dielectric) layer is first pattern-etched to form openings in it. Next, at least one metallic layer is deposited over the insulating layer to metallize its top surface (field), as well as the sidewalls and bottom surfaces of the openings. The metallic layer(s) serves as a conductive plating base (or “seed-layer”), to provide low resistive electric path for the electroplating current. Electroplating is then carried out over the entire metallized surface, including the field and inside the patterned openings. Following plating, the plated metal and any metallization (adhesion, barrier, or seed) layers above the field, as well as any excess plated metal over the openings, are removed by etching, polishing, or by chemical mechanical polishing (CMP). This results in metallic filled vias or trenches (or grooves), surrounded by a dielectric. This method is used, for example, to produce metallic interconnects in semiconductor integrated circuits devices. The present invention is related to this second method of ECD (or electroplating) over the entire surface, including over the field and inside the openings.

Usually, when using electrolytes without surface active additives, the plating rate inside the openings is slower than at the field. Due to higher electric field at the top corners of the openings, the local current density (and plating rate) is higher at the top corners, leading to faster growth and pinching-off of the top corners. This leads to deleterious voids in the filling, as shown in FIG. 1. Also, during electroplating, the relatively stagnant electrolyte inside the openings results in poor replenishment and depletion of the plating ion there. This leads to slower plating rate inside the openings than over the field, resulting in voids in the filling (cf. FIG. 1). The plating ion depletion is more severe at the bottom of the openings, and less severe near the top corners. The plating ion concentration gradient produces increasing concentration polarization, which leads to a decreasing plating rate along the depth of the opening. These inherent void-forming problems become more severe with decreasing width and increasing aspect ratio of the openings. The higher the aspect ratio (AR) of the opening, the slower the plating rate inside it, relative to the field. These problems result in poor or incomplete (voids) filling of high AR openings (cf. FIG. 1).

Aspect ratio (AR) is defined herein (cf. FIG. 1) as the ratio between height (or depth), h, of an opening and its smallest lateral dimension, W (width of a trench, or diameter of a via): AR=h/W The openings may consist of, for example, vias or trenches (or grooves) in a dielectric layer, such as used in the fabrication of interconnects in semiconductor integrated circuit devices.

The filling problems become more severe with decreasing lateral dimension W and increasing AR of the openings. For example, in today's most advanced copper filling of trenches and vias in integrated circuit interconnects, the openings may have an aspect ratio as high as 8:1 (h=1.4 μm; W=0.18 μm), and future trench and via openings will likely require W≦0.10-0.13 μm, and AR≧10:1. Reliable, void-free filling of such narrow and high AR openings imposes a great deal of difficulty.

In order to overcome the natural tendency to form voids, commercial electrolytes, such as acidic copper sulfate, usually include proprietary surface active “brightener” and/or “leveler” additives. The proprietary additives usually comprise organic compounds with functional groups containing sulfur and/or nitrogen atoms. These compounds adsorb onto growth sites of the depositing metal surface, thereby inhibiting (or suppressing) the metal deposition rate. The adsorption and its associated inhibition lead to smaller (finer) grains of the depositing metal, thus producing smoother and brighter deposits. Leveling is obtained by higher concentration of inhibitor (or additives) at protrusion tips sticking into the diffusion layer, thereby inhibiting (or suppressing) their growth. As a result, inhibition is stronger at protrusions, compared with the flat surface. In much the same way, the relatively stagnant electrolyte inside narrow openings results in poor replenishment and depletion of the inhibitor there. This depletion results in reduced inhibition and faster growth inside the openings. Due to better supply of the inhibitor at the top corners and the field, inhibition is stronger at the top corners of openings and at the field (compared with inside the openings). The reduced inhibition inside narrow openings speeds up the plating rate there (relative to the field), thus facilitating void-free filling (or “superfilling”) of narrow openings with large aspect ratios. The mechanism of superfilling narrow openings, using inhibiting additives, was proposed in several publications. For examples, see an article entitled: “Damascene copper electroplating for chip interconnects”, by P. C. Andricacos, at al. in IBM Journal of Research and Development, Vol. 42(5), pp. 567-574, 1998, and an article entitled: “Copper On-Chip Interconnections”, by P. C. Andricacos in The Electrochemical Society Interface, pp. 32-37, Spring 1999.

Clearly, in order to achieve void-free “superfilling” of narrow openings, the beneficial effect of inhibition gradients must overcome the intrinsic void-forming effects due to (a) higher electric field (and current density) at the top corners and, (b) decreasing plating rate inside openings along their depth due to depletion of the plating ion there.

As openings get narrower, and the aspect ratio increases, void-free ECD filling becomes harder and harder to control. While wider openings may fill well, narrower ones may have voids, and vice versa. For example, see an article entitled: “Factors Influencing Damascene Feature Fill Using Copper PVD and Electroplating”, by J. Reid et al. in Journal of Solid State Technology, Vol. 43(7), pp. 86-103, July 2000. Process latitude, such as the useful range of additive concentration and/or plating rate, becomes very tight and hard to control.

Prior art ECD tools and methods commonly employ relatively slow laminar (or “natural”) flow of electrolyte across the substrate's surface. For example, U.S. Pat. Nos. 6,080,291, 6,179,983, and 6,228,232 employ a perforated (or “diffusion”) plate or a porous membrane, placed between the anode and cathode (substrate), in order to achieve laminar flow across the substrate's surface. Such flow results in a relatively thick diffusion layer. The thick diffusion layer limits the useful plating rate to only about 0.3-0.4 μm/min, thereby limiting the throughput of single-wafer plating modules. In addition, prior art Cu-plated wafers usually display relatively rough, matte or semi-matte, surfaces. The rough plated surfaces include protrusions or bumps over filled openings, as well as spikes (or “balloons”) and steps (or “humps”) at boundaries between the field and patterned arrays of narrow openings (cf. FIG. 5). Such spikes, humps, or bumps cause excessive erosion and dishing during successive CMP steps and must be eliminated or minimized. Other prior art ECD tools, such as the one disclosed in U.S. Pat. No. 6,176,992 by Talieh, employ brush plating. The brush rubs the substrate's surface during plating. This is claimed to result in more planar plating after filling the openings. However, the continuous rubbing generates particulates due to wear of the brush or pads, or from the depositing metal.

Also, prior art ECD tools and methods often rely on wafer rotation to improve axial uniformity. However, unless certain strict conditions (such as no edge effects, infinite wafer's radius, infinite electrolyte volume, low plating ion concentration, and laminar flow) are satisfied, the wafer rotation creates non-uniform electrolyte flow across its surface. While electrolyte flow is slow at the center of the wafer, its (tangential) velocity increases with the radius. That velocity difference increases with rotation speed. As a result, the thickness of the diffusion layer varies as a function of the radius R (cf. 22 in FIG. 2). It has a maximum at the center of the wafer and gradually decreases along the radius, toward the edge of the wafer. The non-uniform diffusion layer produces severe non-uniformity of the plated layer along the radius of the wafer. It may also lead to deleterious voids in ECD-filled openings (such as trenches or vias) at certain radii of the wafers. These problems become more severe with increasing rotation speed and wafer diameter.

SUMMARY OF THE INVENTION

One embodiment of the invention relates to an apparatus for electrochemical deposition (ECD) of a metal or an alloy inside at least one opening located at a front surface of a substrate, said apparatus comprising: (a) an electrochemical deposition (ECD) cell adapted to contain an electrolyte, the electrolyte comprising plating metallic ions and at least one inhibitor additive; (b) a cathode comprising at least a portion of said front surface of the substrate, wherein at least one surface inside said at least one opening comprises an exposed metallic surface, and wherein at least a portion of said cathode is immersed in said electrolyte; (c) at least one anode, wherein at least a portion of the at least one anode is immersed in said electrolyte; (d) a power supply adapted to generate an electroplating current through the electrolyte between said cathode and said at least one anode; and (e) means for producing a turbulent flow of the electrolyte across the front surface of the substrate, wherein the turbulent flow is capable of producing a diffusion layer thickness of less than about 50 μm at said front surface of the substrate.

One goal of the invention is to improve void-free filling of narrow openings by enhancing inhibition at the field and top corners during electroplating, while reducing inhibition inside the openings.

Another goal is to create as large as possible concentration difference (or gradient) of the inhibitor between the field and along the depth of the openings, by depleting the inhibitor inside the openings and by enhancing supply and concentration of inhibitor(s) at the field and top corners of the openings.

Another goal is to improve plating throughput and process latitude.

Yet another goal is to improve plating uniformity and void-free filling along the radius of plated wafers.

IN THE DRAWINGS

FIG. 1 shows a schematic cross section of a high aspect ratio opening filled by prior art ECD. The undesirable void is produced by faster plating rate at the field and top corners than inside the opening.

FIG. 2 compares schematic (i.e., models of) diffusion layer thickness profiles across a wafer diameter, produced by prior art ECD tools and methods and by the inventive methods.

FIGS. 3 a-3 h show schematic (i.e., models of) comparisons between prior art ECD plating and by the enhanced void-free filling methods and apparatus of the present invention.

FIG. 4 shows SEM photographs of two cleaved samples with void-free Cu-filled trenches, using a JECD tool at 120 mA/cm², in accordance with the invention. One sample shows partial fill and the other shows complete filling of the trenches.

FIG. 5 shows a typical AFM trace taken at a boundary between the field and a patterned trench array, over a Cu surface plated by a conventional prior-art ECD tool.

FIG. 6 shows a schematic (not to scale) inhibition model of brightening and/or leveling mechanism.

FIG. 7 shows a SEM photograph of lightly etched cross section of void-free Cu-filled trenches by a JECD tool, in accordance with the invention. The trenches are ˜0.05 μm wide (at their bottom), ˜1.41 μm deep, and have aspect ratio of about 28:1.

FIG. 8 shows an anodes/jets-nozzles assembly in accordance with embodiments of the invention. Vigorous agitation is produced by high pressure jets flow.

FIG. 9 shows an anodes/jets-nozzles assembly in accordance with embodiments of the invention. Vigorous agitation is produced by high pressure jets flow and/or by mechanical means, such as wiping brushes (or pads), or by a combination of jets and wiping brushes to enhance inhibitor supply to the field.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

In general, prior art ECD tools utilize a relatively slow (“natural” or “laminar”) flow of the electrolyte across the surface of the substrate (or wafer). The slow flow results in a relatively thick stagnant diffusion (or boundary) layer. The thickness of the diffusion layer decreases with the velocity of the electrolyte across the surface of the substrate. In prior art ECD tools, used for copper filling of integrated circuits interconnects, the diffusion layer is typically about 20-100 μm thick. Also, the prior art ECD tools and methods often rely on wafer rotation to improve axial uniformity. However, unless certain strict conditions (such as no edge effects, infinite wafer's radius, infinite electrolyte volume, low plating ion concentration, and laminar flow) are satisfied, the wafer rotation creates non-uniform electrolyte flow across its surface. For example, edge effects are always present due to the wafer holding fixture, plating ion concentration is high, electrolyte volume is small, and rotation speed of more than about 80 revolutions per minute (RPM) of 300 mm wafers produces non laminar flow near the edge. As a result, prior art ECD tools and methods usually produce a mix of laminar and some turbulent flows.

FIG. 2 shows schematic (not to scale i.e., models of) distribution profiles of the diffusion layer thickness ΔX across a wafer, corresponding to prior art tools and methods 22 and the inventive methods 24, respectively. Due to wafer rotation in the prior art ECD tools, electrolyte flow at the center of the wafer is slow, but the flow (tangential) velocity increases with the radius R. As a result, the diffusion layer thickness ΔX varies as a function of the radius (cf. 22). It has a maximum at the center of the wafer and decreases along the radius, toward the edge of the wafer. The non-uniformity of the diffusion layer becomes worse with increasing rotation speed and wafer diameter. The non-uniform diffusion layer produces severe non-uniformity of the plated layer along the radius of the wafer. It may also lead to deleterious voids (cf. FIG. 1) in ECD-filled openings (such as trenches or vias), at certain radii of the wafers. These problems become more severe with increasing wafer rotation speed and diameter. In contrast, the inventive methods and apparatus produce substantially uniform vigorous agitation, resulting in much thinner and uniform diffusion layer thickness across the wafer (cf. 24). Although the inventive methods may also utilize wafer rotation (to improve axial uniformity), the rotational flow component is relatively small, compared with the other vigorous flow components.

In a preferred embodiment, uniform vigorous electrolyte agitation at the field outside openings is produced, for example, by using high pressure Jets ECD (JECD), such as disclosed in U.S. Pat. No. 5,421,987 (Jun. 6, 1995), by Tzanavaras and Cohen, incorporated herein by reference. The powerful high pressure jets agitation can readily reduce the field diffusion layer thickness by an order of magnitude, to about 2-10 μm, or less, thereby significantly enhancing replenishment of the inhibitor at the field. At the same time, the electrolyte remains essentially stagnant inside narrow openings, thus leading to local depletion of inhibitor there. In addition, the vigorous jets agitation also mitigates non-uniformity of the diffusion layer thickness along the wafer's radius, due to wafer (or anodes/jets assembly) rotation. The powerful turbulent jets flow is much more prevalent than the rotational flow.

EXAMPLE 1

Limiting current measurements (of plating current saturation at increasing cathode voltage) were taken with and without jets flow, using a jets ECD (JECD) tool similar to the one disclosed in U.S. Pat. No. 5,421,987. The acidic copper electrolyte did not include any inhibition organic additives. It consisted of 0.3M CuSO₄, 10% (v/v) H₂SO₄, and ˜60 ppm chloride ions. In one measurement, a “natural” flow of about 2 gallons per minute (GPM) was used through an inlet at the bottom of the plating cell. The observed limiting current density, i_(L), was about 55 mA/cm². In another measurement, a high pressure of 40 pounds per square inch (PSI) jets flow of 2.8 GPM was used with anode/jets assembly (AJA) rotation speed of 20 RPM, under otherwise identical conditions to the first measurement. The distance between the jet nozzles and the wafer was about 1″. A limiting current i_(L) could not be reached even at current density as high as 733 mA/cm². Further increase of the cathode voltage merely increased (linearly) the plating current without any signs of approaching saturation. This represents a factor of at least 13× higher limiting current using the jets flow, compared with the “natural” flow. The limiting current is inversely proportional to the diffusion layer thickness: i _(L) =KC/ΔX where K is a constant and C is the plating ion (bulk) concentration. These measurements indicate that the diffusion layer thickness can be reduced by at least an order of magnitude using powerful high pressure turbulent jets flow, compared with that of “natural” flow; to about 2-10 μm, or less.

Due to strong damping of the jets flow in the bath, ΔX and i_(L) are sensitive to the (inlet) jets pressure and to the distance between the jet nozzles and the substrate's surface. The higher the jets pressure and the closer the substrate's surface to the jet nozzles, the thinner is ΔX and the larger is i_(L). For reasons of system integrity, avoiding device damage by the jets, pump cost, and heat generation, a pressure range of 30-50 PSI is preferred. The distance between the substrate's surface and the jet nozzles is optimized between conflicting requirements. On one hand it is desirable to shorten this distance as much as possible, in order to mitigate bath damping and to achieve powerful jets impinging on the substrate. On the other hand, the overlapping area on the substrate covered by neighboring jets is reduced with the distance between the nozzles and the substrate. Lack of sufficient overlap between the jet cones leads to periodic non-uniformity across the substrate. The preferred optimized distance between the nozzles and the substrate is in the range of about 0.5-2.0″, and more preferably in the range of 0.75-1.5″.

FIGS. 3 a-3 h show schematic (not to scale i.e., models of) comparisons between prior art ECD filling, and the enhanced void-free ECD filling of the present invention. Inhibitor additives are used in both cases. Plating inhibition (or suppression) can be produced by many surface active materials (adsorbates) which adsorb onto the surface of the plating metal or alloy.

FIGS. 3(a) and 3(b) compare inhibitor concentration profiles 31, of prior art ECD methods and/or tools, and the present invention, respectively. Concentration profiles 31 are shown as a function of the distance X into the electrolyte, adjacent to a flat substrate surface 30. The prior art diffusion layer thickness (ΔX₁) is much larger than that produced by the inventive methods of the present invention (ΔX₂): ΔX₁>>ΔX₂ During electroplating, inhibitor is consumed at the advancing deposit surface, thereby depleting its interface concentration. The interface concentration is smaller than the bulk concentration, C_(∞). A concentration gradient of the inhibitor is thus established across the (stagnant) diffusion layer. Due to much thinner diffusion layer of the inventive methods, the inhibitor flat (or field) interface concentration (C_(F2)) of the inventive methods is much larger than that of prior art (C_(F1)): C_(F2)>>C_(F1)

FIGS. 3(c) and 3(d) compare the inhibitor concentration profiles 33 inside a narrow opening 34, and at adjacent field 32 outside the opening. FIGS. 3(c) and 3(d) correspond to conventional prior art ECD methods and/or tools, and to the inventive methods, respectively. As is well known to those skilled in the art, multiplicity of openings, such as opening 34, are etched in a dielectric (or insulating) material 36 prior to electroplating. Although the description herein relates sometimes to a single opening 34, it should be understood by those skilled in the art that the same description also applies to multiplicity of openings 34. Similarly, although the description herein refers to a single inhibitor, it should be understood by those skilled in the art that the same description also applies to multiplicity of inhibitors. Opening 34 includes sidewall surfaces 37 and bottom surface 38. Also as well-known, prior to electroplating, a metallic barrier layer and a copper seed layer (not shown) are formed by dry deposition techniques, such as by physical vapor deposition (PVD) or by chemical vapor deposition (CVD). The metallic barrier and the Cu seed layer must be continuous over the surfaces of sidewalls 37 and bottom 38 of openings 34, as well as over the field 32. The barrier layer usually comprises a refractory metal (such as Ta or Ti) or an alloy comprising a refractory metal (such as TaN_(X) or TiN_(X)). The barrier layer serves two functions: to minimize out-diffusion of Cu into the dielectric material 36 and/or into the underlying semiconductor device (not shown), and to improve adhesion between dielectric material 36 and the Cu seed layer. The Cu seed layer provides low electrical resistance path on the surface, required for the electroplating.

FIGS. 3(c) and 3(d) show that during electroplating, due to a substantially stagnant electrolyte inside opening 34, the inhibitor is greatly (or fully) depleted near the bottom 38 of opening 34. The inventive methods aim to enhance replenishment of the inhibitor at the top field surface 32. They produce much larger inhibitor's concentration at the field than that obtained by prior art: C_(F2)>>C_(F1). Inhibitor concentration variances (between the field 32 and the opening's bottom 38) are defined herein: ΔC ₁ =C _(F1) −C _(B1)  (cf. FIG. 3 c); prior art ΔC ₂ =C _(F2) −C _(B2)  (cf. FIG. 3 d); present invention Clearly, the concentration variance (or difference) obtained by the inventive methods is much larger than that obtained by prior art: ΔC₂>>ΔC₁. The inventive methods greatly enhance inhibition (and slowing-down) of electroplating rate at the field 32, relative to inside opening 34.

FIGS. 3(e) and 3(f) show plating rate profiles 35 obtained by prior art ECD methods and tools, and by the inventive methods, respectively. Due to essentially stagnant and small volume (and large area/volume ratio) of the electrolyte inside the opening, the plating depletes the inhibitor there. This depletion establishes concentration gradients of the inhibitor along the sidewalls (depth) of the opening, with maximum depletion at the bottom 38. The degree of inhibition inside opening 34 decreases gradually along its depth and has a minimum near its bottom 38. This results in maximum plating rate near bottom 38, and slower plating rate at the field 32 and top corners. Plating rate variances (between opening's bottom 38 and field 32) are defined herein: ΔV ₁ =V _(B1) −V _(F1)  (cf. FIG. 3 e); prior art ΔV ₂ =V _(B2) −V _(F2)  (cf. FIG. 3 f); present invention

Positive plating rate variance (ΔV>0) facilitates void-free filling (or “superfilling”) of narrow openings. The larger this variance, the more pronounced and prevalent the superfilling mechanism. While prior art ECD methods and tools provide relatively small (or marginal) plating rate variance, the inventive methods produce much larger variance: ΔV₂>>ΔV₁. In other words, prior art ECD methods and tools produce only slightly larger plating rate at the bottom 38 than at the field 32: V_(B1)≳V_(F1). In fact, due to inherent competing void-forming mechanisms, they often result in negative variance: ΔV₁≲0. In contrast, the inventive methods greatly enhance inhibition at the field 32 and top corners, thereby significantly slowing-down the plating rate there (relative to bottom 38): V_(B2)>>V_(F2). The inventive methods aim at producing large plating rate variance: ΔV₂>>0. They thus overcome inherent void-forming mechanisms (due to higher electric field at the top corners and due to depletion of plating ions inside openings). These inherent void-forming mechanisms produce negative plating rate variance (ΔV<0), leading to pinching-off of the top corners, and to void-formation. The inventive methods significantly enhance void-free superfilling by generating a large and positive ΔV₂. This large variance also facilitates wider process latitude, such as wider useful ranges of inhibitor concentration and plating rates.

FIGS. 3(g) and 3(h) show filling profiles 39 obtained by prior art ECD methods and tools, and by the inventive methods, respectively. While prior art methods and tools produce conformal (or marginally filling) profiles which often lead to seam-voids, the inventive methods produce much more pronounced superfilling profile. Robust prevalent superfilling is the result of significantly slower plating rate at the field 32 and top corners, with a pronounced maximum near bottom 38, as shown in FIG. 3(f). Such a plating rate profile facilitates progressive sealing of narrow openings without seam-voids (“superfilling”). This sealing proceeds like a zipper, from bottom to top, where most of the filling is due to lateral sidewall growth.

EXAMPLE 2

FIG. 4 shows scanning electron micrographs (SEM) of two cleaved samples with Cu-superfilled trenches, in accordance with a preferred embodiment of the invention. Both samples were plated by a jets ECD (JECD) tool, similar to the one disclosed in U.S. Pat. No. 5,421,987. Both samples were plated with current density of 120 mA/cm² (plating rate of about 2.8 μm/min). FIG. 4(a) shows a sample in which the plating was terminated prior to completion (partially filled trenches), thereby revealing the superfilling profile. The growth profiles on the bottom and sidewalls in FIG. 4(a) correspond well with the superfill model FIG. 3(h). The plating rate is fastest at the bottom (maximum depletion and minimum inhibition) and is slowest at the top (minimum depletion and maximum inhibition) of the openings. It is the plating rate gradients, along the depth of the openings, which facilitate void-free filling of narrow openings. The model and the experimental results imply that, in very narrow openings (with large aspect ratios), most of the filling is due to lateral growth components, substantially normal to the sidewalls. This explains why narrower openings fill earlier than wider openings. Almost one half of the upper width of the opening has to grow laterally, in order to seal the opening. The sealing of the opening proceeds like-a-zipper, from bottom to top. FIG. 4(b) shows a sample in which the plating was terminated some time after complete filling of the trenches. Note the flat smooth Cu-plated surface and the lack of any bumps above filled openings. No plating voids were found in either of these samples. The jets pressure was 40 PSI, and rotation speed of the anodes/jets assembly (AJA) was 20 RPM. The distance between the wafer and the nozzles was about 1″, and a backing flat plastic plate was placed at the back of the wafer in order to avoid wafer buckling under the powerful jets. The acidic copper sulfate electrolyte was similar to the one described in Example 1, and further included proprietary two-component (“Gleam-PPR”) additive system from Shipley-Lea Ronal Company.

The powerful jets agitation enhances replenishment of inhibitor at the field, while inside the narrow openings the electrolyte remains essentially stagnant. The impinging powerful jets create vigorous turbulent agitation at the wafer's surface, thereby significantly reducing the diffusion layer thickness, ΔX₂, as seen in FIG. 3(b). During electroplating, the faster plating rate at the bottom 38 facilitates the desirable void-free filling (or “superfilling”) profile, as shown in FIGS. 3(h) and 4(a).

The powerful jets agitation provides another important advantage. It facilitates significant increase of the plating rate without “burning” the deposit. It was found that the plating rate could be safely increased to 2.8 μm/min, without any deleterious effects. This is about 8× faster than typical prior art plating rate of about 0.35 μm/min. It facilitates more than double the throughput per module, at no extra cost. In fact, it was found that the surface becomes brighter by increasing the plating rate. At 2.8 μm/min, wafers appear fully bright (cf. FIG. 4(b)). However, the surface brightness and roughness of the plated Cu layer is also a strong function of the seed layer. Some seed layers result in significantly rougher plated Cu than others.

FIG. 5 shows an atomic force microscope (AFM) trace, taken over the surface of a wafer which was Cu-plated using a conventional prior-art tool (Novellus Systems, Inc). The AFM trace was taken over a border between flat field and an array of trenches of 0.35 μm/0.35 μm lines/spaces. The wafer was plated to a nominal thickness of 1.5 μm. The AFM trace shows a large spike (˜520 nm) and an elevation step (˜250 nm) at the border.

EXAMPLE 3

Using a JECD tool as in Example 2, two wafers were plated at low and high plating rates. All other plating parameters (or variables) were identical to those in Example 2. As in FIG. 5, the wafers had patterned arrays of 0.35/0.35 μm lines/spaces, and were plated to a nominal thickness of 1.5 μm. The first sample was plated at low current density (15 mA/cm² or ˜0.35 μm/min), and the second sample was plated at high current density (120 mA/cm² or ˜2.8 μm/min). While the lower plating rate produced a small step (or hump) of about 140 nm at the field/array border, the sample plated at the higher rate revealed no transition spike or step at all. For comparison, FIG. 5 shows that a wafer plated by a prior art tool had a large spike (˜520 nm) and elevation step (˜250 nm) at the border. Surface roughness of JECD plated samples was measured by AFM. While the mean surface roughness (Ra) of a low current (15 mA/cm²) sample was 11.2 nm, that of a sample plated at the higher current (120 mA/cm²) was only 7.3 nm.

JECD plating did not require any additional (third component) “leveler” organic additive, or the use of a complex pulse or periodic reversal pulse plating, in order to eliminate the spikes and steps common in prior art ECD plating. As disclosed in an article by Reid et al. in Journal of Solid State Technology, Vol. 43(7), pp. 86-103, July 2000, and in an article by Mikkola et al. in 2000 International Interconnect Technology Conference (IITC), pp. 117-119, June 2000, addition of a third component “leveler” additive greatly complicates the required bath analysis and control. It may also result in top center voids and poor filling of larger features. As disclosed in an article by Gandikota et al. in 2000 International Interconnect Technology Conference (IITC), pp. 239-241, June 2000, and in an article by Hsie et al. in 2000 International Interconnect Technology Conference (IITC), pp. 182-184, June 2000, pulse plating, and in particular periodic reversal (PR) plating, slows-down the throughput and further complicates the required control. It may also result in larger grains, rougher surface, and longer self-anneal time of the plated Cu films.

FIG. 6 shows a schematic inhibition model explaining the improved JECD leveling and brightness with increasing plating rate. At low current density (15 mA/cm²), there is only small depletion (ΔC₁₅) of the inhibitor at the wafer's interface. As a result, there is very little or no inhibition differentiation between a growing protrusion and the flat surface. Due to higher concentration of the plating ions ahead of the flat interface, protrusions (after exceeding a critical size) can continue to grow faster than the flat surface. This leads to amplification of the protrusions and surface roughening. At high plating rate, the high current density (105 mA/cm²) creates larger (or deeper) depletion (ΔC₁₀₅>>ΔC₁₅) of the inhibitor at the (flat) wafer interface. The tip of a growing protrusion “sees” significantly larger concentration of the inhibitor than the flat surface. Protrusions which stick into the diffusion layer are suppressed (or inhibited) by the higher concentration of the inhibitor ahead of the flat surface. The flat surface, on the other hand, “sees” deeper depletion (or lower concentration) of inhibitor as the current density increases, thus enhancing its deposition rate (relative to protrusions). As described in an article by Cohen et al. in 2000 Proc. 17^(th) Intl. VLSI Multilevel Interconnect Conference (VMIC), pp. 21-26, June 2000, and in the article by Hsie et al., the enhanced suppression of protrusions at higher current densities leads to smoother and brighter deposits.

The common prior art humps or bumps are due to coalescence of individual micro-bumps (above the top corners of openings) into larger bumps or humps, associated with insufficient leveling or brightening. The coalesced bumps or humps continue to amplify and grow faster than the flat field, since they “see” larger concentration of the plating ions ahead of the flat field. The inventive methods improve leveling, thereby reducing or eliminating the bumps and humps. The most important factor for leveling (for a given chemistry of additives and their bulk concentrations) is to create a large concentration gradient across the diffusion layer (ΔC_(inh)/Δx). In accordance with the invention, this is achieved by increasing the current density (larger ΔC_(inh)) and by the powerful vigorous jets agitation (smaller ΔX).

As openings get narrower, the ratio of surface area (A) to volume (V) of the opening becomes larger. This ratio is reciprocal to W, the diameter of vias or width of trenches: A/V∝1/W A larger ratio A/V results in faster depletion of the inhibitor inside the openings. In other words, it takes less time (keeping all other parameters the same) to establish the inhibitor depletion gradients inside the opening. This, in turn, results in thinner transitional conformal growth on the bottom and sidewalls, prior to superfill commencement. As described by the articles of Andricacos, and Andricacos et al., growing conformal and non-conformal (or anti-conformal) layers on the sidewalls lead to deleterious seam-voids in the center of trenches or vias (cf. FIG. 1). Their thickness must be minimized in order to achieve void-free filling. Thus, it is actually easier to deplete the inhibitor inside narrower openings! However, even complete depletion inside the openings does not guarantee void-free filling. The latter requires both, depletion of inhibitor inside the opening and sufficient supply of it at the top field. Without the latter, ΔC_(inh) between the field and the inside of the opening becomes too small to enable the superfill mechanism. This is where the inventive methods become so beneficial. The powerful jets of JECD and/or brush wiping enhance considerably the supply of inhibitor to the field (compared with prior art tools and methods), thereby significantly increasing ΔC_(inh). The larger ΔC_(inh) produced by the inventive methods enables superfilling of even the narrowest (as well as the wider) openings, with wide process latitude.

EXAMPLE 4

FIG. 7 shows a scanning electron micrograph (SEM) of a cleaved sample plated by a JECD tool, in accordance with the invention. The photo shows void-free filled trenches which were only about 0.05 μm wide (at their bottom), about 1.41 μm deep, and had aspect ratio of about 28:1. The plating current density was 30 mA/cm². All other plating parameters were the same as in Examples 2 and 3. These are currently the narrowest (with largest AR) void-free ECD Cu-filled trenches ever demonstrated.

FIG. 8 shows a preferred embodiment enhancing inhibitor supply to the field, using a JECD apparatus similar to the one disclosed in U.S. Pat. No. 5,421,987. Anode segments 82 and jet nozzles 84 are assembled in a manner to provide high pressure jets producing uniform powerful and vigorous turbulent flow of electrolyte over the surface of a facing substrate (not shown). The nozzles 84 are assembled on a platen 86. Anode segments 82 may consist of a single copper sheet or a multiplicity of copper pieces electrically connected together. In order to improve axial uniformity, either the anodes/jets assembly (AJA) rotates and the substrate is held stationary, or the substrate rotates and the AJA is held stationary. Alternatively, both the AJA and the substrate rotate in opposite directions. The plating cell configuration may be horizontal or vertical. In the horizontal configuration the substrate (wafer) is either facing down or is facing up the AJA. In the vertical configuration both the wafer and the AJA face each other in a vertical configuration. The powerful jets produce significantly enhanced supply of inhibitor to the field, while maintaining substantially stagnant electrolyte inside narrow openings.

FIG. 9 shows alternative embodiments in which enhanced inhibitor supply to the field is produced by wiping brushes (or pads, or wiper blades) 80. During electroplating, the brushes and/or the substrate move or rotate relative to, or across, each other in a manner to provide mechanical wiping between the brushes (or pads, or wiper blades) and the surface of the substrate (not shown). Although the description herein relates sometimes to brushes (or pads, or blades), it should be understood by those skilled in the art that the same description also applies to a single brush (or a pad, or a wiper blade). The mechanical wiping removes depleted electrolyte from the field surface, thereby enhancing replenishment of fresh (with undepleted inhibitor) electrolyte to the field. At the same time, electrolyte inside narrow openings remains essentially stagnant and unremoved by the wiping brush (or pad, or wiper blade), thus leading to local depletion of inhibitor there. The wiping pad (or blade, or brush) does not have to actually make continuous contact with the surface of the substrate. It is sufficient to move it (or to move the substrate relative to it) within close proximity to the substrate's surface, in order to obliterate the relatively thick diffusion layer established prior to the wiping action. For example, a wiping blade (or pad or brush) within a distance of about 5-50 μm from the substrate surface is sufficient to wipe out and obliterate the diffusion layer from the field and replenish it with fresh solution containing full inhibitor concentration. Non-contacting, or slightly contacting, wipe has the advantage of avoiding particulate formation due to rubbing between the wipe and the substrate's surface. A non-contacting wipe can be obtained, for example, by hydrodynamically floating the wipe over the substrate's surface, similar to floating magnetic heads over rotating disks, or by shaping the wipe to float as an hydrofoil or hydroplane.

In one embodiment, brushes (or pads, or blades) 80 alone, without jet-nozzles 84, produce significantly enhanced supply of inhibitor to the field, while maintaining substantially stagnant electrolyte, containing depleted inhibitor, inside narrow openings. The relative wiping motion does not have to be rotational. It may also be linear (not shown).

In another embodiment, both high pressure jets (using jet-nozzles 84) and wiping brushes (or pads, or blades) 80 are combined together to produce the benefits of both methods. They produce significantly enhanced supply of inhibitor to the field, while maintaining substantially stagnant electrolyte (containing depleted inhibitor) inside narrow openings. The relative wiping motion does not have to be rotational. It may also be linear (not shown).

In yet another embodiment, ultrasonic or megasonic agitation (not shown), with or without brushes 80 and/or jet-nozzles 84, produces significantly enhanced supply of inhibitor to the field, while maintaining substantially stagnant electrolyte inside narrow openings.

The inventive methods increase the concentration gradients of the inhibitor, ΔC_(inh)/h, along the depth (h) of the opening, thus producing larger inhibition gradient from the field to the bottom of the opening. The larger this gradient, the more effective and prevalent the superfill mechanism. At large enough gradient, even the narrowest opening can be filled without voids. At the same time, increasing this gradient also facilitates wider process latitude. For example, utilizing a JECD tool, it was possible to obtain simultaneous void-free filling of very narrow, as well as wide openings, while using wide range of plating rates (0.35-2.8 μm/min). Similarly, using a JECD tool, it was possible to vary the nominal additive concentration by more than 100%, without deleterious effects.

Other embodiments of the invention provide for one or more of the following: (a) increasing the global plating rate (in order to facilitate depletion of inhibitor inside openings); (b) narrowing the openings and/or increasing their aspect ratio (in order to facilitate depletion of inhibitor inside openings); (c) decreasing the bulk inhibitor concentration (in order to facilitate depletion of inhibitor inside openings); and (d) increasing adsorption strength (and inhibition effectiveness) by selecting at least one powerful inhibitor (or adsorbate).

However, it should be understood by those skilled in the art that there are optimal ranges for these variables which may intricately depend on each other. For example, while increasing the global plating rate facilitates depletion of inhibitor inside the openings, it also increases depletion of the inhibitor at the field. This may in turn require higher inhibitor bulk concentration. Therefore, the optimal range of plating rate is strongly dependent upon inhibitor concentration. Similarly, although depletion is faster inside narrower openings (with larger AR), inherent void-forming effects (due to higher electric field at the top corners and plating ion depletion inside the openings) are stronger with such openings. Also, while decreasing bulk inhibitor concentration helps depletion inside the openings, it may result in insufficient concentration at the field, thereby adversely affecting void-free filling. Thus, too low or too high inhibitor bulk concentration or global plating rate may result in insufficient inhibitor gradients for successful void-free filling. Too low inhibitor concentration may result in insufficient field concentration, thereby producing too small inhibitor gradients. Too high inhibitor bulk concentration may result in insufficient depletion inside the openings, thereby leading to too small inhibitor gradients. Similarly, too low plating rate may not be sufficient to deplete the inhibitor inside the openings, while too high plating rate may deplete too much the field. Therefore, the optimal ranges of these variables are intricately dependent on each other. They must be optimized in concert with each other.

It should be understood that the above-described embodiments can be used to fabricate any number of devices including, and without limitation, metallic interconnects in semiconductor integrated circuit devices, thin film heads, micromachined Microelectromechanical Systems (MEMS) devices, or interconnects in high density electronic packages (such as chip scale and wafer scale packaging).

Those skilled in the art will recognize that the foregoing description has been presented for the sake of illustration and description only. As such, it is not intended to be exhaustive or to limit the invention to the precise form disclosed. 

1. An apparatus for electrochemical deposition (ECD) of a metal or an alloy inside at least one opening located at a front surface of a substrate, said apparatus comprising: an electrochemical deposition (ECD) cell adapted to contain an electrolyte, the electrolyte comprising plating metallic ions and at least one inhibitor additive; a cathode comprising at least a portion of said front surface of the substrate, wherein at least one surface inside said at least one opening comprises an exposed metallic surface, and wherein at least a portion of said cathode is immersed in said electrolyte; at least one anode, wherein at least a portion of the at least one anode is immersed in said electrolyte; a power supply adapted to generate an electroplating current through the electrolyte between said cathode and said at least one anode; and means for producing a turbulent flow of the electrolyte across the front surface of the substrate, wherein the turbulent flow is capable of producing a diffusion layer thickness of less than about 50 μm at said front surface of the substrate.
 2. The apparatus of claim 1 wherein (a) said means for producing a turbulent flow comprise electrolyte jets adapted to impinge on the substrate at a substantially normal direction to the front surface of the substrate, wherein the jets and/or the substrate are adapted to rotate about the normal direction, (b) said diffusion layer thickness is less than about 25 μm, and (c) said power supply is adapted to generate an average electroplating current density of at least 20 mA/cm².
 3. The apparatus of claim 2 wherein the power supply is adapted to generate an average electroplating current density of at least 30 mA/cm².
 4. The apparatus of claim 2 wherein the power supply is adapted to generate an average electroplating current density of at least 60 mA/cm².
 5. The apparatus of claim 2 wherein the power supply is adapted to generate an average electroplating current density of at least 120 mA/cm².
 6. The apparatus of claim 2 wherein said means for producing a turbulent flow further comprise ultrasonic or megasonic agitation of the electrolyte.
 7. The apparatus of claim 1 wherein (a) said means for producing a turbulent flow comprise one or more wiping blades, or one or more wiping pads, or one or more wiping brushes, adapted to move relative to the substrate, and/or the substrate adapted to move relative to one or more wiping brushes, or relative to one or more wiping pads, or relative to one or more wiping blades, (b) said diffusion layer thickness is less than about 25 μm, and (c) said power supply is adapted to generate an average electroplating current density of at least 20 mA/cm².
 8. The apparatus of claim 7 wherein said means for producing a turbulent flow comprise one or more non-contacting wiping blades adapted to move relative to the substrate, and/or the substrate adapted to move relative to one or more non-contacting wiping blades.
 9. The apparatus of claim 8 wherein said power supply is adapted to generate an average electroplating current density of at least 30 mA/cm².
 10. The apparatus of claim 8 wherein said means for producing a turbulent flow further comprise electrolyte jets adapted to impinge on the substrate at a substantially normal direction to the front surface of the substrate.
 11. The apparatus of claim 10 wherein said means for producing a turbulent flow further comprise ultrasonic or megasonic agitation of the electrolyte.
 12. The apparatus of claim 8 wherein the relative movement of the one or more non-contacting wiping blades, and/or the relative movement of the substrate, comprises a linear motion.
 13. The apparatus of claim 12 wherein the power supply is adapted to generate an average electroplating current density of at least 30 mA/cm².
 14. The apparatus of claim 12 wherein the power supply is adapted to generate an average electroplating current density of at least 60 mA/cm².
 15. The apparatus of claim 12 wherein the power supply is adapted to generate an average electroplating current density of at least 120 mA/cm².
 16. An apparatus for electrochemical deposition (ECD) of a metal or an alloy inside at least one opening located at a front surface of a substrate, said apparatus comprising: an electrochemical deposition (ECD) cell adapted to contain an electrolyte, the electrolyte comprising plating metallic ions and at least one inhibitor additive, said plating metallic ions having a concentration; a cathode comprising at least a portion of said front surface of the substrate, wherein at least one surface inside the at least one opening comprises an exposed metallic surface, and wherein at least a portion of said cathode is immersed in said electrolyte; at least one anode, wherein at least a portion of the at least one anode is immersed in said electrolyte; a power supply adapted to generate an electroplating current through the electrolyte between said cathode and said at least one anode; and means for producing a turbulent flow of the electrolyte across the front surface of the substrate, wherein said turbulent flow and said concentration are capable of producing an average limiting current density larger than about 500 mA/cm².
 17. The apparatus of claim 16 wherein (a) said means for producing a turbulent flow comprise electrolyte jets adapted to impinge on the substrate at a substantially normal direction to the front surface of the substrate, wherein the jets and/or the substrate are adapted to rotate about the normal direction, and (b) said power supply is adapted to generate an average electroplating current density of at least 20 mA/cm².
 18. The apparatus of claim 17 wherein the power supply is adapted to generate an average electroplating current density of at least 30 mA/cm².
 19. The apparatus of claim 17 wherein the power supply is adapted to generate an average electroplating current density of at least 60 mA/cm².
 20. The apparatus of claim 16 wherein (a) said means for producing a turbulent flow comprise one or more wiping blades, or one or more wiping pads, or one or more wiping brushes, adapted to move relative to the substrate, and/or the substrate adapted to move relative to one or more wiping brushes, or relative to one or more wiping pads, or relative to one or more wiping blades, and (b) said power supply is adapted to generate an average electroplating current density of at least 20 mA/cm².
 21. The apparatus of claim 20 wherein said means for producing a turbulent flow comprise one or more non-contacting wiping blades adapted to move relative to the substrate, and/or the substrate adapted to move relative to one or more non-contacting wiping blades.
 22. The apparatus of claim 20 wherein the relative movement of the one or more non-contacting wiping blades, and/or the relative movement of the substrate, comprises a linear motion, and wherein the turbulent flow is capable of producing a diffusion layer thickness of less than about 50 μm at said front surface of the substrate.
 23. The apparatus of claim 22 wherein said power supply is adapted to generate an average electroplating current density of at least 30 mA/cm².
 24. The apparatus of claim 22 wherein said power supply is adapted to generate an average electroplating current density of at least 60 mA/cm². 